PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG037207t1
Common NameTCM_037207
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HD-ZIP
Protein Properties Length: 881aa    MW: 96806 Da    PI: 6.0874
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG037207t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox58.79.9e-1961119357
                       --SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHC....TS-HHHHHHHHHHHHHHHHC CS
          Homeobox   3 kRttftkeqleeLeelFeknrypsaeereeLAkkl....gLterqVkvWFqNrRakekk 57 
                       k  ++t+eq+++Le+l++++++ps  +r++L +++    +++ +q+kvWFqNrR +ek+
  Thecc1EG037207t1  61 KYVRYTPEQVDALERLYHECPKPSSMRRQQLIRECpilaNIEPKQIKVWFQNRRCREKQ 119
                       5679*****************************************************97 PP

2START185.43.1e-582054132205
                       HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS.SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-SEEEEEEEECTT..EEEE CS
             START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskvdsgealrasgvvdmvlallveellddkeqWdetlakaetlevissg..galq 92 
                       +aee+++e+++ka+ ++  Wv+++ +++g++++ +++ s++++g a+ra+g+v  +++  v+e+l+d++ W ++++++++++v+s+g  g+++
  Thecc1EG037207t1 205 IAEETLTEFLSKATGTAVEWVQMPGMKPGPDSIGIVAISHGCTGVAARACGLVGLDPT-RVAEILKDRPSWFRDCRAVDVMNVLSTGngGTIE 296
                       789*******************************************************.8888888888************************ PP

                       EEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--....-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHH CS
             START  93 lmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe...sssvvRaellpSgiliepksnghskvtwvehvdlkgrlphwll 181
                       l +++l+a+++l+p Rdf+ +Ry+  l++g++v++++S++++q+ p+    +++vRae+lpSg+li+p+++g+s +++v+h+dl+ ++++++l
  Thecc1EG037207t1 297 LLYMQLYAPTTLAPaRDFWLLRYTSVLEDGSLVVCERSLNNTQNGPSippAANFVRAEMLPSGYLIRPCEGGGSIIHIVDHMDLEPWSVPEVL 389
                       **********************************************9999******************************************* PP

                       HHHHHHHHHHHHHHHHHHTXXXXX CS
             START 182 rslvksglaegaktwvatlqrqce 205
                       r+l++s++  ++kt++a+l+++++
  Thecc1EG037207t1 390 RPLYESSTLLAQKTTMAALRHLRQ 413
                       *******************99876 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5007115.75856120IPR001356Homeobox domain
SMARTSM003896.8E-1658124IPR001356Homeobox domain
CDDcd000868.39E-1761121No hitNo description
SuperFamilySSF466898.55E-1761124IPR009057Homeodomain-like
PfamPF000462.6E-1662119IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.601.0E-1863119IPR009057Homeodomain-like
CDDcd146861.22E-6113152No hitNo description
PROSITE profilePS5084826.368195423IPR002913START domain
CDDcd088757.06E-84199414No hitNo description
SuperFamilySSF559615.49E-38204416No hitNo description
SMARTSM002341.5E-44204414IPR002913START domain
Gene3DG3DSA:3.30.530.204.8E-23204410IPR023393START-like domain
PfamPF018529.6E-56205413IPR002913START domain
PfamPF086704.7E-51739879IPR013978MEKHLA
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0008284Biological Processpositive regulation of cell proliferation
GO:0009733Biological Processresponse to auxin
GO:0010067Biological Processprocambium histogenesis
GO:0010072Biological Processprimary shoot apical meristem specification
GO:0010089Biological Processxylem development
GO:0045597Biological Processpositive regulation of cell differentiation
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 881 aa     Download sequence    Send to blast
MLIAAEGTGK TKAKATKNLE WIFEVSPVIR SCPKVPEENE VMMAVTSSCK EGNKIAMDNG  60
KYVRYTPEQV DALERLYHEC PKPSSMRRQQ LIRECPILAN IEPKQIKVWF QNRRCREKQR  120
KEASRLQAVN RKLTAMNKLL MEENDRLQKQ VSQLVYENSY FRQQTQNATL ATTDTSCESV  180
VTSGQHHLTP QHPPRDASPA GLLSIAEETL TEFLSKATGT AVEWVQMPGM KPGPDSIGIV  240
AISHGCTGVA ARACGLVGLD PTRVAEILKD RPSWFRDCRA VDVMNVLSTG NGGTIELLYM  300
QLYAPTTLAP ARDFWLLRYT SVLEDGSLVV CERSLNNTQN GPSIPPAANF VRAEMLPSGY  360
LIRPCEGGGS IIHIVDHMDL EPWSVPEVLR PLYESSTLLA QKTTMAALRH LRQISQEISQ  420
PNVTGWGRRP AALRALSQKL SKGFNEAVNG FTDEGWSMLE SDGVDDVTLL VNSSPGKMMG  480
INLSYSNGFP SMGNAVLCAK ASMLLQNVPP AILLRFLREH RSEWADSGID AYSAAAVKAG  540
PCSLPVSRGG SFGGQVILPL AHTIEHEEFM EVIKLENMGH YRDDMIMPGD IFLLQLCSGV  600
DENAVGTCAE LIFAPIDASF SDDAPIIPSG FRIIPLDSGM DASSPNRTLD LASTLEVGAA  660
GNRATGDHSG RCGSTKSVMT IAFQFVYEIH LQENVATMAR QYVRSIIASV QRVALALSPS  720
RFGSLADFRT PPGTPEAQTL GRWICDSYRC YLGVELLKNE GSESILKMLW HHTDAVLCCS  780
LKALPVFTFA NQAGLDMLET TLVALQDISL EKIFDENGRK ALFAEFPQVM QQGFMCLQGG  840
ICLSSMGRPV SYERAVAWKV VNDEENAHCI CFMFINWSFV *
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007012152.10.0Class III HD-Zip protein 8 isoform 1
SwissprotQ391230.0ATHB8_ARATH; Homeobox-leucine zipper protein ATHB-8
TrEMBLA0A061GJS30.0A0A061GJS3_THECC; Class III HD-Zip protein 8 isoform 1
STRINGPOPTR_0006s25390.10.0(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM20222678
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G32880.10.0homeobox gene 8
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]